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Abstract 

It becomes m.orc difficult to explain the social information transfer phenomena using the classic models 

based merely on Shannon Information Theory (SIT) and Classic Probability Theory (CPT), because the 

transfer process in the social world is rich of semantic and highly contextualized. This paper aims to use 

^ twitter data to explore whether the traditional models can interpret information transfer in social net- 

l~~5 works, and whether quantum-like phenomena can be spotted in social networks. Our main contributions 

^+ are: (1) SIT and CPT fail to interpret the information transfer occurring in Twitter; and (2) Quantum 

interference exists in Twitter, and (3) a mathematical model is proposed to elucidate the spotted quantum 

I I ' phenomena. 

Q Introduction 

Shannon Information Theory (SIT) 1 1 and Classic Probability Theory (CPT) quantify the information by 
►^ encoding information as symbols and ignoring their semantic aspects. Such a strategy is very successful in 

^—{ capturing the essential structure and dynamics of information transfer, therefore becoming the backbone 

00 of the modern telecommunication and network transmission technology. As the Web2.0 increases in 

>0 popularity, our current ways of communicating are changing dramatically. Facebook has become gathering 

^--; spot in our daily lives by connecting like-minded folks and establishing virtual communities to solve 

C^^ problems and accomplish tasks. News and comments about natural disasters (e.g. Japans Tsunami) and 

^^ political uprisings (e.g. Middle Easts antigovernment protests) travel with a lightning speed in Twitter. 

^^ As a result, the focus of information transfer gradually move from its technological aspect to social aspect, 

• • where the semantic and human factor becomes so important that cannot be neglected. 

It becomes more and more difficult to explain the social information transfer phenomena using the 
classic models based merely on CPT and SIT, because the transfer process in the social world is highly 
^ contextualized. The influence of possible channels of flowing information in the complex and yet dynamic 

social networks is sophisticated enough that any classic probabilistic model might hardly satisfy. Many 
recent studies are devoted to social information transfer in different social networks [2}|5] and several 
interesting findings have been observed. However, none of them attempted to disclose the insufficiency 
of SIT and CPT to interpret some aspects of social information transfer or proposed a new model that 
that may better serve as a supplementation to the existing classic models. 

Recently Purdue University received $25 million funding from U.S. National Science Foundation to 
create the Science of Information Center to move beyond Shannon theory. They aim to develop principles 
to encompass concepts of structure, time, space, and semantics to aid better understanding of social 
networks and social media behaviors pj. The information is no longer the Is or Os of binary code and its 
meaning can be contextually interpreted based on different temporal, spatial and semantic factors. The 
value of information can change dramatically over time and the flow of information can heavily depend on 
how people use and trust these channels. The Web is becoming more complicated and dynamic, currently 
there is no good way to measure how information is transferred and evolved on the Web. SIT only cares 
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about the physical essence of the information but ignore its semantic essence. 

It is time to rethink how information is diffused through social networks so that we can better 
understand the essential difference between social information transfer and physical information transfer. 
The origin of information transfer in social networks comes from social influence, which occurs when an 
individual's thoughts, feelings or actions are affected by other people. Information transfer characterizes 
the way that a node in social network can spread information to its neighbor nodes via exerting social 
influence to them. Consequently, those neighbor nodes can continually influence their own neighbor 
nodes to further spread that information. Given the complexity in social information transfer, some 
well-established conclusion in SIT and CPT may not hold any more. 

According to SIT, there is a default assumption in information transfer that all information transfer 
channels are independent and there are no interferences among them. However, this default assumption 
might be incorrect. For example, if a node in social network is influenced by two neighbor nodes, and 
tries to make decisions based on the information obtained from the two neighbors, it is natural for this 
node to consider the two neighbors together rather than think of them independently. In other words, 
the information transferred via the two neighbors might have an interference effect on each other. Such 
interference between different information channels is very similar to the interference between different 
light waves that has already been explained by Quantum Theory (QT) ff\. Grounded on a mathematical 
basis, QT bears the potential to model and calculate the contextual and semantic information associated 
with one entity, which takes into account not only the independent features of this entity but also the 
dependent features in different contexts and the interference of these dependent features. 

Whether quantum-like phenomena can be spotted in social networks is the question we hope to explore. 
To address it, we specifically studied the information transfer in Twitter and our main contributions 
include: 

• discovered the phenomenon that the amount of information transfer in Twitter NOT always monoton- 
ously increases along with number of channels, which is incompatible with some conclusions in SIT. 



• 



proposed a quantum version of q-attention model that is capable of mathematically interpreting 
the conflict between decreased information transfer and increased channels 



Methodology 

Information Transfer in Social Network 

In a social network, there're no clearly defined information source, channel and receiver, because each 
agent in the network is a node. According to different situations, each node can act as a source, or a 
channel or a receiver. Figure IT] shows different information transfer patterns: 1) one-channel information 
transfer pattern: A is the source while C is the receiver, and information flows from Aio C via a channel 
created by an intermediate node B; 2) two-channel information transfer pattern: A is the source while C 
is the receiver, and information flows from Aio C via two channels created by intermediate nodes Bi and 
-B2, respectively; and 3) n-channel information transfer pattern: A is the source while C is the receiver, 
and information flows from A to C via a number of channels created by a list of intermediate nodes Bi , 
B2,..., Bn- A simple research question can be specifled as: How does the total amount of information 
transferred from A to C change with the increase/decrease of the number of available channels ? 

Information transfer in Twitter 

Twitter users send and read messages called tweets. One user can follow other users and retweet their 
tweets. Such retweeting relationships connect Twitter users and form the social network where informa- 
tion is transfered via retweeting, as identified by RT@username in tweets. Assume ^ is a Twitter user 
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Figure 1. Information transfer patterns 



who has two followers Bi and i?2- -Bi and B2 have some common followers and C is one of them. One 
tweet coming from user A can be retweeted by both Bi and B2- If Bi or B2 retweets A's tweets, their 
tweets contain RT @A. Here, we assume that C does not directly follow A and can only see j4's tweets 
retweeted by Bi or i?2- So, C can decide whether to further retweet A's tweets from Bi or i?2- Those 
retweets posted by C contain the sign as: RT@Bi : @A or RT@B2 : @^ which means that C retweets 
A's tweets via Bi or B2. 

Now we define the retweeting probability to measure the amount of information transferred from one 
user to another through retweeting, directly or indirectly. If A's tweets are retweeted by B, then the 
retweeting probability is the percentage of A^s tweets retweeted by B. For example, if A posted 100 
tweets and 20 of them are directly retweeted by B. Then, the retweeting probability of B from A is 
20/100 — 0.2, marked as 'P{B\A) = 0.2. Consider the situation where an intermediate node B exists 
between A and C (Figure flVa)). If A posted 100 tweets and 20 of them are retweeted by B. Of the 20 
retweets, 5 are further retweeted by C. Then the retweeting probability of C from A via B, is 5/100 = 
0.05, marked as P(C|A; B) = 0.05. Note that it is different from the case that C directly retweets A, in 
which no intermediate node B exists. 

If the number of channels between A and C increases to two(Figure[I[b)), we can define three retweet- 
ing probabilities: P{C\A;Bi) indicates the probability that A's tweets are retweeted by Bi and further 
retweeted by C as if B2 does not exist; P(C|A; B2) indicates the probability that A's tweets are retweeted 
by B and further retweeted by C as if B2 does not exist; P(C|A;i3i, B2) indicates the probability that 
A's tweets are retweeted by either of Bi or B2, and further retweeted by C. 

In general, as is shown in Figurefljc), we can define n separate retweeting probabilities (e.g., P(C|A; Bi), 
P(C|A;i?2), ...,P{C\A]Bn)) and an overall retweeting probability, i.e. P(C|A; i3i,B2, ..., B„). The rela- 
tionship between P{C\A; Bi, B2, ■■■, Bn) and n separate probabilities, i.e., P{C\A;Bi),...,P{C\A-,Bn) is 
of our interest. 



Q-attention Model 

To determine the relationship between retweeting probabihty and the number of channels, we proposed 
a q-attention model based on CPT by taking into account human cognition (Figure l2|. The q-attention 
model originates from Batchelder's work ^ presenting a family of processing models for the source- 
monitoring paradigm in human memory, but is specifically modified to cater to our information transfer 
research in Twitter. The model is designed for the situation in which agent A posts a tweet that is 
received by a group of agents Bj, j = 1,2, ...n, who may or may not choose to retweet it to agent C. 
There is no direct connection between A and C and the tweets from A can only reach C through one of 
the agents Bj. We examine this probability as a function of the number, denoted n, of agents Bj who 
both see A's tweets and retweet them to C. It is true that there is no universal way to build a classic 
model based on CPT and human cognition for information transfer and our q-attention model is just one 
of many possible models. 

We assume that agent C has some limits on capacity for the number of tweets that can be considered. 
Consequently agent C can only pay attention to a total number of N intermediate agents forming a total 
set T = {Bi, i?2, ---Bn} . A subset of n < A^ of these agents Sn — {Bi, ..., i?„} C T receive A's tweets 
and also retweet them to C. In other words, C has total of N channels to receive message, but only n of 
them transfer tweets from A. A^ is a fixed number but n is a variable. 




Figure 2. q-attention model 

Define qj as the probability that agent C pays attention to the tweets from agent Bj e T, and we 
require X^i^i 1j = 1- The joint probability that agent Bj is selected for attention and then agent C 
decides to retweets from A passed along by Bj equals to qj ■ P{C\A; Bj). Finally, for some set of agents 
Sn , the probability that agent C retweets from agent A through any one of the members in Sn is obtained 
from the sum of path probabilities and also known as the law of total probability in CPT: 



P{C\A-Sn)^ J2 1j-PiC\A;B, 



(1) 
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For simplicity we use P(n) to represent P{C\A; Sn)- In order to calculate Equation 111 we need to 
know qj and P{C\A; Bj),j = 1,2, ...n. Actually qi are free parameters which can be hardly estimated 
from our data. A^ is the maximum number of channels C can pay attention to, which satisfy that 



P(N+k) = P{N), k e N+. We consider the initial values P(l) can be directly observed from the real data 
because the interference occurs only when n > 1. In addition, we assume that qi = ~_5i! , i = 2,3, ...,n, 
and P(C|j4; Bj) = p for all channels. Finally, the q-attention model can be represented by a piecewise 
defined function: 

'P(7i) = P(l) + ^p ■n,n = 2,3, ..., N-1 

P{N + k) =P(7V),fceN+ ^ ' 

Now our task is to estimate the parameters of q-attention model in order to calculate P{n) in Equation^ 
Before reaching to the plateau, P(n) is just a monotone increasing linear function, with positive slope 
;;^^p and through a fixed initial point P(l). Actually, it is very difficult to estimate qi,N and p from 
real data. Thus, instead of estimating those parameters separately, we will use linear regression approach 
based on least square principle through a fixed point P(l) to fit a linear curve based on data points from 
the real Twitter data. Then the curve can be used to predict P(n). 

It is important to note that according to this classic model, P(C|^; Sn) increases monotonously with 
the variable n. In other words, adding more channels can never decrease the probability. 

Results 

We based our calculation on two tweets datasets using different crawling methods. One dataset is used 
to check the information transfer from a global scale, since the tweets come from public timelines. By 
contrast, the other dataset serves as a case study of information transfer from a local scale, since the 
tweets comes from an ego network. 

Dataset 1 

The first dataset contains 467 million tweets from 20 million public users for the time period from June 
to December 2009 [^ which covers 20%-30% of total public tweets published on Twitter during that 
time frame. Only those tweets containing RT <§usernaniel: RT @username2 are selected. Then we 
grouped those tweets into different patterns based on the number of channels (See Figure [l]). Figure [s] 
shows the distribution of the amount of information transfer patterns with different number of channels 
in datasetl. The distribution is skewed towards the information transfer patterns with relatively small 
number of channels. As the number of channels increases, the number of instances drops significantly. It 
implies that the occurrence of patterns with a large number of channels is relatively rare. 

The largest number of channels we found in our data is 29. However, we only selected one-channel 
through six-channel patterns to present our results, for two reasons: 

• The occurrence of patterns with a large number of channels is relatively rare. When the number 
of channels become very large, their results can be subject to random disturbance which yields 
unreliable observations. 

• The process of information transfer, described in q-attention model, consists of the linear increasing 
phase and the plateau phase. Only the increasing phase is relevant to the problem under study. 

The retweeting probability of an instance of n-channel pattern in Twitter data is calculated by: 

J2 No. of tweets containing 'RT@Bj : RT@A' 

pfn) = ^^^^" (3) 

No. of tweets A posted 



-"^ http://snap.stanford.cdu/data/#twitter 
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Figure 3. No. of instances in different information transfer patterns in datasetl 



To present the real retweeting probability for each transfer pattern, we take the average value of 
retweeting probabilities calculated from all instances belonging to that transfer pattern. And the q- 
attention model results are just the linear regression of the real retweeting probabilities of different 
patterns. Figure l4] compares the retweeting probability from the real data (noted as real value), with 
that from the q-attention model. Surprisingly, there are two drops in retweeting probability as the 
number of channels increases: a big one from 1 to 2, and a small one from 5 to 6. It is totally acceptable 
that the our q-attention model value cannot exactly match the real value because of the statistical errors. 
However, the observation that the amount of information occasionally decrease as the number of channels 
increase cannot be attributed to pure statistical errors, because it is contradictory to the one of the main 
conclusions in SIT, that increasing channels always increase the capacity of information being transferred. 
The results from datasetl provide the global view about information transfer in Twitter, which q-attention 
model based on CPT cannot explain. 



Dataset2 

The second dataset was crawled by members in Knowledge Engineering Lab at Tsinghua University. 
This dataset covered time frame from Aug. 1st to Dec. 12th, 2009. Crawling started from a specific 
user (@yanglicai, a relatively popular user in Chinese Twitter community) of Twitter, and algorithm was 
designed to check all his contacts involved in replies and retweets, who were not necessarily his followers 
or followees. Crawling proceeds in a breadth-first approach and results in 192,999 tweets from 8,254 
users, and 25.5% of all tweets collected are retweeting messages. We can view dataset2 as an ego network 
with @yanglicai in the center and all other nodes having interactions with @yanglicai. 

Figure [5] shows that the average number of records in dataset2 is much smaller than datasetl, because 
the tweets in dataset2 come from a special ego social network. The largest number of channels we found 
in dataset2 is 21 and we also selected one-channel to six-channel for analysis. In Figure |6J the difference 
of retweeting probability between the real data (noted as the real value), and the q-attention model is 
shown. Similar to datasetl, two drops are spotted: from patternl to pattern2, from pattern4 to pattern5. 
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Figure 4. Comparison between real value and classic model in datasetl 
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Figure 5. No. of instances in different information transfer patterns in dataset2 



The results from dataset2 provide the local scale about information transfer in Twitter, which further 
confirm the contradictory phenomenon in information transfer as we saw in datasetl from the global 
scale. 

Based on the results from the above two datasets, we found that the amount of information transfer 
drops occasionally even though the number of information transfer channels increases. It is contradictory 
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Figure 6. Comparison between real value and classic model in dataset2 



to SIT and CPT, which assumes that the increase of channels never decrease the information transfer. 
Someone may challenge our observation since the q-attention model is only one of many approaches to 
model information transfer and we take several assumptions to further simplify our model into a lineal 
function. Admittedly, using different classic model may yield different results. However, as long as the 
model is built on the law of total probability from CPT, the result will be consistent with the conclusion 
that increasing channels never decrease the information transfer amount proved in SIT. Therefore, the 
contradictory phenomenon we found in Twitter data is independent of which model we use. 



Discussion 



The main deficit of our proposed q-attention model lies in the assumption that the n information transfer 
channels are independent. However, such assumption tends to be counter intuitive. Let's see a concrete 
example. Assuming that C is a teacher and A is his student. C wants to write a personal evaluation 
about A and hope to know after-school behaviors about A. First, C obtains some of A's information 
via one of A's classmates Bi. Then, in order to know more about A, C decides to inquiry A's another 
classmate i?2- 

Assuming that the amount of information C gets from A through each single channel, i.e. from Bl 
or B2, is the same, can we definitely claim that C obtains more information about A after inquiring _B2? 
Not necessary. For example, if C asks Bi whether A has a sister and Bi says yes, then C tends to judge 
A has a sister. However, if C asks B2 the same question but B2 says no, then how could C make his 
judgement? As a result, C may be confused and won't know whether A has a sister or not. In this case, 
for C, the information about A actually decreased when the number of information channels increases 
from one to two. The reason is that the information provided by two channels is not independent but have 
mutual interference in semantic. Moreover, such interference not only exist in the semantic aspect but 
also in the contextual aspect. Specifically, who provides the information, as well as when and where that 
information is provided, all contribute to the interference effect. QT provides an approach to model such 
complex inference which beyond the scope of SIT and CPT that only consider the quantity of information 



but ignore its semantic and contextual aspects. 

Next we will introduce the quantum interference phenomenon in a physics experiment and how we 
can make an analogy between quantum interferences in physics and in information transfer. Then we 
will propose a quantum version of q-attention model to mathematically interpret the conflict between the 
increased number of channels and decreased amount of information transfer. 

Prom Double-slit Experiment to Information Transfer 

Quantum interference was first observed in a double-slit experiment by Thomas Young in 1803 [7] . Small 
quantum particles or waves pass through two slits interfered with each other and generated a pattern of 
bright and dark bands. The beams emerging from the two slits are coherent as they come from the same 
source (Figure [7|. 
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Figure 7. The double-slit experiment 



Now we introduce the social information transfer in the context of double slit experiment in Figure [7] 
Image that particles are shot from the lamp towards the two slits 5'i and 5*2. Once the particle passes 
through ^i or 5*2, they hit the detector panel, positioned behind the screen, in a particular location x with 
probabihty PsiSa i^)- By closing slit 5*2, it is possible to measure the probability of particles being detected 
in position x passing through ^i , namely Ps^ (x). Similarly, by closing j ust slit 5*1 , we can measure Ps^ (x) . 
Based on CPT, we would expect that the probability particles being detected at x when both slits are open 
is the sum of the probability of passing through A and being detected at x, Pg^ (x) , and the probability 
of passing through B and hitting the detector panel in x, Pg^ix). Let's make an analogy between particle 
shooting and information transfer defined in SIT. The lamp is equivalent to information source, the 
particles are equivalent information, the two slits are equivalent to two information channels, and the 
position X in the screen is equivalent to information receiver. The amount of information transferred from 
the source to x is measured by probability. According to SIT, the total capacity of information transfer 
is the sum of capacity of each channel, i.e. Ps^sA^) = Psii^) + Ps2{x). Although the information is 
actually measured by bit in SIT, the law of total probability still holds. 

If a third slit S^ is open and particles can also reach x in the panel through it. To measure Ps^{x), 
we need to close both 5*1 and 5*2. As a result, the total information transferred will be PsiSaSala^) = 
Psi{x) -\- Ps2{x) + Ps^{x). We defined a function Pn{x) representing the information transfer probability 
with the number of channels. Obviously, Pn{x) is a monotone increase function. However, there should 
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exist a upper limit upon the total amount of information transfer, because not all slits on the wall can let 
particles pass through and reach position x. Similarly, any real information receiver is subject to some 
receiving limit that increasing the information channel cannot further increasing the information amount 
as long as the limit is reached. In the q-attention model, such limit is considered as human's capacity of 
paying attention to a finite number of channels. 

As we mentioned above, the monotonous increase property of Pn{x) is built on CPT and confirmed 
in SIT. However, the double-slit experiment demonstrates the existence of quantum interference effect. 
In other words, PsiS2 [x) ^ Ps^ {x) + Pg2 {^)- ^^ particular, Psi5'2 {^) "^^^^ be zero even though Ps^ [x) > 
and Ps2{x) > 0. 

Now let's see how quantum-like interference may possibly occur in social information transfer. In 
Figure |l(a)[ the information is transferred from source A to receiver C via one channel connecting 
through B. There can be no quantum interference effect since only one information channel exists. C 
has no other alternative channel to obtain A's information except for B. However, when there is more 



than one channel, interference among these channels becomes possible. For example, in Figure 1(b) C 
can obtain information about A from either Bi or i?2. According to SIT, path A-Bi-C and A-B2-C 
are independent. In other words, the amount of information C obtained from A, is just the sum of 
amount of information form A to Bi to C as if B2 does not exist, and that of from A to B2 to C as 
if Bi does not exist. Because it assumes that Bi and B2 are totally unrelated from C's perspective. 
This is also consistent with SIT: more channels always tend to provide more information and therefore 
reduce uncertainty (entropy) about an event. However, we believe it is oversimplified in social network 
because the semantic of information are neglected. Assuming that the information content from Bi and 
B2 are contradictory, then C's uncertainty about A will only increase instead of decreasing. In other 
words, the total information transferred from A to C actually decreases according to C's cognition, when 
more channels are available. Here the quantum interference takes effect, because C tends to compare 
the information from different channel and make decisions. In addition, by considering the context of 
information channels, C is more likely to trust the channel which exerts more social influence upon him. 
In essence, the SIT only focuses on the quantity of information being transferred but ignore its semantic 
and contextual perspectives. 

Quantum Q-attention Model 

Since the q-attention model based on CPT cannot interpret the conflicts between increasing channel 
and decreasing information transfer, now we formulate a quantum version of the q-attention model, by 
taking into account the possible interference between channels. According to the probability defined in 
QT [9] , a probability p of an outcome is not primitive but derived from something more primitive called 
a probability amplitude '0 (a complex number) - the probability is obtained by squaring the magnitude 
of the amplitude, p = |V'| . There are three differences between the quantum q-attention model and the 
classic version. 

The first difference is that the classic probability of paying attention to agent Bj , previously denoted 
qj , is replaced by a probability amplitude denoted Tpj . The agent C is assumed to be in a superposition 
state, denoted by 

N 

|^„)=^0,.|i3,), (4) 

which represents this agent's potential to consider any one of the tweets from agents Bj in T, especially 
including the tweets posted from agent A passed along by the agents Bj in T. The probability amplitude 
ipj represents the 'potential' to consider the tweets of agent Bj, but this potential is represented by a 
complex number; the probability of paying attention to the tweets of agent Bj is obtained from the 
squared magnitude qj = IV^jl and again we require J2i=i \^j\ — 1- 
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The second difference is that the classic probabihty that C retweets the from A through Bj, previ- 
ously denoted P(C|A; Bj) is replaced by the probability amplitude (C|A; Bj). The probability amplitude 
{C\A]Bj) represents the 'potential' to retweet, but this potential is represented by a complex number; 
the probability of retweeting is obtained from the squared magnitude 'P{C\A\Bj) — \{C\A\Bj)\ and 
once again we require P(C|j4; Bj) + P(C'|^; Bj) — 1. 

The third difference is that QT obeys the law of total amplitude rather than the law of total probability 
from CPT. We define (C|S'„) as the probability amplitude that the agent C retweets the message from 
A by considering tweets from agents Bj in T. The probability that agent C retweets from agent A passed 
along by agents Bj in 5'„ then equals P(C|A;S'„) = |(C|A;5'„)| . According to QT, to determine the 
amplitude (C|A; Sn) j we replace the sum of path probabilities shown in Equation [I] with the sum of path 
amplitudes given below 

(C|A;5„)= Y. ^j-{C\A-,B,). (5) 

Predictions for n=l,2,3 

Let us see some examples of how our q-attention model encompass the interference effect. First consider 
n = 1 in which case there is only one agent Si — {Bi} . Then in this case P(l) = \4'i ■ {G\A; Bi)\'^ = 
\ipi\ ■ \{C\A; Bi)\ . The probability that C retrweets A through this one agent is virtually the same as 
the classic version and no quantum interference occurs in this case. 

Next consider n = 2 in which case there are two agents 5*2 = {Bi,B2}. In this case Equation [s] 
produces 

P(2) = \^p,-{C\A;B,)+ij2-{C\A;B2)f 

= |Vi-(C|v4;i3i>|V|V2-(C|A;i?2)|' 

+ {ri-^2)-{C\A;B,)*-{C\A;B2) 

+ {^i-r2)-{C\A;Bi)-{C\A;B2)*. 

The first two terms correspond to the same probabilities that one would obtain from the classic model. 
The last two terms form a conjugate pair, 

{tPl ■ ^2) ■ {C\A; Bi>* • {C\A; B^) - |^i • V2 • {C\A- B{) ■ {C\A- B^) \ ■ (cos (^12) + z • sin (^12)) 
(Vi • ^;) • {C\A; B,) ■ {C\A; B^)* = |^i • V2 • {C\A- B,) ■ {C\A; i?2)| • (cos (^12) - z • sin (^12)) , 

where 9i2 is the phase of each complex number which depends on the pair of agents {Bi, B2} ■ The sum 
of this conjugate pair produces a real number called the interference term: 

Inti2 = 2 • 1^1 • i/;2 • {C\A; Bi) ■ {C\A; B2) \ ■ cos (^12) . 

The cosine term can be positive (producing constructive interference), negative (producing destructive 
interference) , or zero (producing no interference) . If the cosine is sufficiently negative, then the probability 
of C indirectly retweets from A when given two paths can be smaller than the probability given a single 
path. This happens whenever |V'2 • (C'l^! ^2)! + Int < 0. 

When there is no interference, cos (Ou) ~ 0, then the probability that C retweets from A based on 
two intermediate agents must be larger than the probability based on a single agent. This follows the 
fact that when there is no interference, the probabilities from each path sum and exceed the single path 
just like the classic probability model. 

Now consider the case with n = 3 and ^3 — {i?i, B2, B^} . In this case we obtain 



P(3) = \^,-{C\A;Bi)+^P2-{C\A;B2)+^3-{C\A;B3)\ 
\^i-{C\A;Bi)\'' + \4,: 
+Inti2 + /nti3 + Int23 



2 



= 1^1 • {C\A;Bi)\^ + \i,2 ■ {C\A-B2)\' + |^3 • {C\A-B3)Y 
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with the interference defined as 

InU, = 2 • 1^, • Vj • {C\A; B,) ■ {C\A; B,)\ ■ cos (%) . 

Each pair of cosines can be positive or negative. If they are all negative, then we could find a decrease in 
probability for three paths as compared to two paths or one path. But if the cosines flip from positive 
to negative, they could cancel out in the sum leaving the result with no overall interference. In general, 
as more path amplitudes are added together there is a tendency for the interference terms to change 
sign and cancel out, which is called decoherence. When decoherence occurs, the quantum model starts 
behaving like the classic model. 



Related Work 



QT has been applied in information retrieval 10 . Furthermore, research demonstrates that quantum-like 



phenomena exist in human natural language, cognition and decision making 11 12 . For example, the 
term bat can be modeled in a two dimensional vector space as a vector representing a linear combination of 
two senses: sports and animal. The same goes for the term boxer: sports and animal. Then the combination 



of boxer bat follows the quantum formalism 11 . Experiments show that many conceCPT combinations 
are non-separable meaning that the combination cannot be modeled by probability distributions across 
the senses of the individual words. The non-linear or non-separable combination of two entities exists 
not only in information retrieval but also other fields which deal with social interaction and cognitive 
interference. Phenomena containing interference cannot be modeled via classical probabilistic models. 
QT offers a probabilistic, logic and geometric formalism based on the mathematics of Hilbert spaces to 
describe the behavior of interference. 

Van Rijsbergen fl3 provided a mechanism to describe logic-based IR models within a quantum for- 
malism, in particular with Hilbert spaces. One of the well-adopted is Logical Imaging (LI) [m]. LI 
calculates a probability that a document is relevant to a query by considering the correlation of terms 
that appears across different documents. Huertas-Rosero, Azzopardi, and van Rijsbergen fl5] proposed 
quantum-based measures for documents by using the selective erasers. Erasers model the relation of one 
term with respect to another by using co-occurrence methods, and the distance between this term and 



other neighboring occurrences. Arafat and van Rijsbergen 10 applied QT to address some fundamental 
issues in search, which investigate and model user cognition and their interactions during the search 
process. Di Buccio, Lalmas, and Melucci [16] proposed a uniform way to model properties of entities, 
relationships of entities, and properties of relationships through a geometric framework in terms of vector 
subspaces 17 . Zuccon, Azzopardi and van Rijsbergen [15] proposed to represent documents and queries 



as subspaces rather than as vectors in traditional vector-space model. Then the relevance between the 
document and user query can be reformulated as how to distinguish preparations of different quantum 
systems, which can be measured using subspace distance. Hou and Song |18 proposed an extended 
vector space model (EVSM) to model context-sensitive high-order information. Zuccon, Azzopardi, and 
van Rijsbergen [15] suggest to rank the document relevance using QT as judgment of relevant is not 
independent from other documents, and the interference of other documents play an important role to 
judge the relevance. They proposed a novel quantum probability ranking principle (QPRP) to model 
situations that a document relevance assessment is influenced by other documents. Interference exists 
in document relevance judgment, especially when users change their relevance measurement for one doc- 



ument after they have measured other documents. Zhang et al. 19 proposed to use quantum finite 
automation (QFA) to represent the transition of the measurement states (the relevance degrees of the 
document judged by users) and dynamically model the cognitive interference of users when they are 
infiuenced by other related documents. Piwowarski, Frommholz, Lalmas, and van Rijsbergen [20j applied 
tensor products from QT to refine the representation of queries (as density operators) and documents (as 
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subspaces). Although QT has been extensively applied to IR and human cognition modeling, but none 
have apply QT to explain the information transfer in soeial networks, such as Twitter. 

Conclusion and Future Work 

In this paper, we studied whether quantum-like phenomena can be spotted in social networks. To address 
it, we constructed two social networks formed by the retweeting relationship in Twitter, proposed a CPT 
based q-attention model to quantify the information transfer by retweeting probability, and compared 
the model results with the real value. We found that CPT and SIT cannot interpret the conflict between 
decreased information transfer and increased channels and proposed a quantum version of q-attention 
model to solve the conflict. 

Although QT provides some promising potential to add semantic and context to classic information 
transfer, a great deal of future work remains. First, we should select more social network datasets, like 
Facebook, Myspace, or Blogs, to test the quantum-like phenomenon. Second, the quantum model we 
proposed only gives the theoretical possibility to explain the quantum-like interference, but does not 
provide empirical interpretation of what the interference are. We may need to apply the social influence 
theory to further explain it. Third, some psychological experiments can be conducted to better understand 
the quantum-like phenomenon. 
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